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Abstract 

Identifying unambiguously the presence of a bubble in an asset price remains an unsolved problem 
in standard econometric and financial economic approaches. A large part of the problem is that the 
fundamental value of an asset is, in general, not directly observable and it is poorly constrained to 
calculate. Further, it is not possible to distinguish between an exponentially growing fundamental 
price and an exponentially growing bubble price. 

In this paper, we present a series of new models based on the Johansen-Ledoit-Sornette (JLS) 
model, which is a flexible tool to detect bubbles and predict changes of regime in financial markets. 
Our new models identify the fundamental value of an asset price and a crash nonlinearity from 
a bubble calibration. In addition to forecasting the time of the end of a bubble, the new models 
can also estimate the fundamental value and the crash nonlinearity, meaning that identifying the 
presence of a bubble is enabled by these models. Besides, the crash nonlinearity obtained in the 
new models presents a new approach to possibly identify the dynamics of a crash after a bubble. 

We test the models using data from three historical bubbles ending in crashes from different 
markets. They are: the Hong Kong Hang Seng index 1997 crash, the S&P 500 index 1987 crash 
(black Monday) and the Shanghai Composite index 2009 crash. All results suggest that the new 
models perform very well in describing bubbles, forecasting their ending times and estimating 
fundamental value and the crash nonlinearity. 

The performance of the new models is tested under both the Gaussian residual assumption and 
non-Gaussian residual assumption. Under the Gaussian residual assumption, nested hypotheses 
with the Wilks statistics are used and the p-values suggest that models with more parameters 
are necessary. Under non-Gaussian residual assumption, we use a bootstrap method to get type I 
and II errors of the hypotheses. All tests confirm that the generalized JLS models provide useful 
improvements over the standard JLS model. 

Keywords: financial bubbles, crash, prediction, fundamental value, nonlinearity, Wilks statistics, 
bootstrap 
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I. INTRODUCTION 



Financial bubbles are genera 
a fundamental value (2), Q, 



ly denned as transient upward accelerations of price above 
14|. Fundamental value reflects the intrinsic value (and is 
sometimes called this) of the asset itself. It is ordinarily calculated by summing the future 
incomes generated by the asset, which are discounted to the present. However, as the future 
income flow is uncertain and not known in advance, and since the interest rates that should 
be used to discount future cash flows are bound to change in ways not yet known at the 
time of the calculation, the fundamental value of the asset is usually hard to estimate. In 
this sense, identifying unambiguously the presence of a bubble remains an unsolved problem 



12|. 



in standard econometric and financial economic approaches 

The Johansen-Ledoit-Sornette (JLS) model 7H9J provides a flexible framework to detect 
bubbles and predict changes of regime in the price time series of a financial asset. It combines 
(i) the economic theory of rational expectation bubbles, (ii) behavioral finance on imitation 
and herding of investors and traders and (iii) the mathematical and statistical physics of 
bifurcations and phase transitions. The model considers the faster-than-exponential (power 
law with finite-time singularity) increase in asset prices decorated by accelerating oscillations 
as the main diagnostic of bubbles. It embodies a positive feedback loop of higher return 
anticipations competing with negative feedback spirals of crash expectations. Our group has 
made many successful predictions using JLS model, such as the 2006 - 2008 oil bubble [151 ]. 
the Chinese index bubble in 2009 [6j, real estate market in Las Vegas [20J], South African 
stock market bubble Ji| and US Repos market 18]. We also have recently developed new 
methods based on this model for forecasting rebounds of the stock market rather than 
crashes 

In this paper, we generalize the standard JLS model by inferring fundamental value of 
the stock and crash nonlinearity from bubble calibration. The new models can not only 
detect the crash time but also estimate the fundamental value and the crash nonlinearity. 
This means that our new model has the ability to identify the presence of a bubble, thereby 
addressing the problem stated at the beginning of this paper. With the estimated fun- 
damental value, another famous unsolved problem becomes easier: distinguishing between 
an exponentially growing fundamental price and an exponentially growing bubble price. 
Furthermore, the new models can also detect the dynamics of crash after the bubble by 
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specifying how the price evolves towards the fundamental value during the crash. 

We test the models using data from three historical bubbles from different markets that 
ended in significant crashes. They are: the Hong Kong Hang Seng index 1997 crash, the 
S&P 500 index 1987 crash (black Monday) and the Shanghai Composite index 2009 crash. 
All results suggest that the new models perform very well in describing bubbles, forecasting 
their ending times and estimating fundamental value and the crash nonlinearity. 

The performance of the new models is tested under both the Gaussian residual assumption 
and non-Gaussian residual assumption. Under the Gaussian residual assumption, nested 
hypotheses with the Wilks statistics are used and the p-values suggest that models with 
more parameters are necessary. Under non-Gaussian residual assumption, we use a bootstrap 
method to get type I and II errors of the hypotheses. All tests confirm that the generalized 
JLS models provide useful improvements over the standard JLS model. 

The paper is constructed as follows. In Section [Til we introduce the standard JLS model 
and our new generalized JLS models. We then analyze three historical bubbles with the 
new models in Section IIII1 In Section IIVI we compare the generalized models statistically to 
confirm that these new models provide useful improvements over the standard JLS model. 
We conclude in Section IVl 



II. JLS MODELS 



Standard JLS model 



In the JLS model 



7H9J, the dynamics of a given asset is described as 

— = n{t)dt + a(t)dW - ndj, (1) 
V 

where p is the asset price, [i is the drift (or trend) and dW is the increment of a standard 
Wiener process (with zero mean and unit variance). The term dj represents a discontinuous 
jump such that j = before the crash and j ' = 1 after the crash occurs. The loss amplitude 
associated with the occurrence of a crash is determined by the parameter k. Each successive 
crash corresponds to a jump of j by one unit. The dynamics of the jumps is governed by a 
crash hazard rate h(t). Since h(t)dt is the probability that the crash occurs between t and 
t + dt conditional on the fact that it has not yet happened, we have E t [dj] = 1 x h(t)dt + 
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x (1 — h(t)dt) and therefore 

E t [dj] = h(t)dt. (2) 

Under the assumption of the JLS model, noise traders exhibit collective herding behaviors 
that may destabilize the market. The JLS model assumes that the aggregate effect of noise 
and fundamental traders can be accounted for by the following dynamics of the crash hazard 
rate 

h(t) = B'(t c - t) m - 1 + C(t c - t) m - 1 cos(u; ln(t c - t) - 0') . (3) 

If the exponent m < 1, the crash hazard may diverge as t approaches a critical time t c , 
corresponding to the end of the bubble. The second term in the r.h.s. of ([3]) takes into 
account the existence of a possible hierarchical cascade of panic acceleration punctuating 
;he course of the bubble, resulting either from a preexisting hierarchy in noise trader sizes 



211 ] and/or the interplay between market price impact inertia and nonlinear fundamental 
value investing 5]. 

The no-arbitrage condition reads E t [dp] = 0, which leads to fi(t) = nh(t). Taking the 
expectation of ([1]) with the condition that no crash has yet occurred gives dp/p = p(t)dt = 
nh(t)dt. Using the crash hazard rate defined in ([3]) and integrating yields the so-called 
log-periodic power law (LPPL) equation for the price: 

\np(t) = F LPPL (t) , (4) 

where 

^LPPLit) = A + B(t c - t) m + C(t c - t) m cos(u; ln(t c -t)-<f>), (5) 



B = -nB'/rn and C = -kC /Vm 2 + u 2 . Note that this expression (T4]) with ((5]) describes 
the average price dynamics only up to the end of the bubble. The JLS model does not 
specify what happens beyond t c . This critical t c is the termination of the bubble regime and 
the transition time to another regime. For < m < 1, the crash hazard rate accelerates up 
to t c but its integral up to t, which controls the total probability for a crash to occur up to t, 
remains finite and less than 1 for all times t < t c . It is this property that makes rational for 
investors to remain invested knowing that a bubble is developing and that a crash is looming 
3, |9[. Indeed, there is still a finite probability that no crash will occur during the lifetime 
of the bubble, including its end. The excess return = nh(t) is the remuneration that 
investors require to remain invested in the bubbly asset, which is exposed to a crash risk. 
The condition that the price remains finite at all time, including t c , requires that m > 0. 



Within the JLS framework, a bubble is identified when the crash hazard rate accelerates. 
According to Q, such accelerates occur when m < 1 and B' > 0, hence B < since m > 
by the condition that the price remains finite. We thus have a first condition for a bubble 
to occur: 

< m < 1 . (6) 

This condition is the mathematical embodiment of our definition of a financial bubble, 
characterized by a faster-than-exponential growth as time approaches the critical time t c . 
Indeed, it is straightforward to verify that the first-order and higher-order derivatives of the 
log-price diverge at t c , in contrast with their finiteness for the standard exponential price 
model. 

By definition, the crash rate should be non-negative. This imposes [if] ] 

b = -Bm-\C\Vm 2 + u 2 >0 . (7) 

B. Modified JLS models 

In an effort to study the fundamental price, we modify and generalize the JLS model as 
follows. We now write the price dynamics of an asset as 

dp = /i(t)pdt + a(t)pdW - n(p - pi^dj, (8) 

where the first two items of the right hand side define the standard geometrical Brownian 
motion and the third term is the jump. 

When the crash occurs at some time t* (implying dj = 1), the price drops abruptly 
by an amplitude n(p(t*) — pi) 1 ■ 

The motivations and the interpretation of the three parameters p%, k and 7 are as follows. 

• For k = 7 = 1, the price drops from p(t*~) to p{t* + ) = p\, i.e., the price changes from 
its value just before the crash to a fixed well-defined valuation p%. In the spirit of 
Fama's analysis of the 19 October 1987 crash |l|], if one interprets the asset price after 
the crash as the "right" price, i.e., the price discovery towards rational equilibrium 
without mispricing, the crash is nothing but an efficient assessment by investors of the 
"true" or fundamental value, once the panic has ended. Hence, p\ can be interpreted 
as the fundamental price which is discovered during the crash dynamics. 
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• Then, k can be thought of as a measure of market efficiency, that is, 1 — k is the 
relative inaccuracy of the discovery of the fundamental price by the market. If, say, 
k = 0.5, this means that the price has dropped by only half of its bubble component, 
and remains over- valued compared with its fundamental component. 

• When different from 1, the exponent 7 can be interpreted as embodying a nonlinear (i) 
over-reaction for small variations and under- reaction for large deviations (0 < 7 < 1) 
or (ii) under-reaction for small variations and over-reaction for large deviations (7 > 1) 
from the fundamental value. 

Since pi is a fixed parameter, the generalized JLS model implies that we should measure 
the price dynamics in the frame moving with the fundamental price. In other words, p\ is the 
fundamental price at the beginning t\ of the time period over which the bubble develops. 
In order to compare in a consistent way the realized price to this fixed parameter, it is 
necessary to discount the asset price continuously by the rate of return of the fundamental 
price. If Pobs(^) denotes the empirical price observed at time t, this means that the price 
p(t) that enters in expression (JE]) is defined by 

* 1 

p(t)= Pohs (t) II 7: 7T^> (9) 

s=tl+ i (l + r/(s))365 

where rf(s) is the annualized growth (risk free) rate of the fundamental price. In our 
empirical analysis, we will take for rf(s) the annualized US 3-month treasury bill rate. 
Applying again the no-arbitrage condition E t [<ip] = to expression flB]) leads to 

^{t)p = K{p-p 1 yh{t) . (10) 

Conditional on the absence of a crash, the dynamics of the expected price obeys the equation 

dp = jj,{t)pdt = k(p — piyhifjdt , (11) 

and the fundamental price must obey the condition p\ < minp(t). For 7 = 1, the solution 
of equation ffTTj) generalizes (@J into 



\n[p(t) - Pl ] = F LPPL (t) , (12) 
where FhPPLit) is again given by expression ([5]). For 7 e (0, 1), the solution is 

( P - Pl y-~< = F LPPL (t) , (13) 



where again ThPPhif) is given by expression (jSJ). We do not consider the case 7 > 1 which 
would give an economically non-sensible behavior, namely the price diverges in finite time 
before the crash hazard rate itself diverges. 

In summary, we shall consider four models M , M l5 M 2 and M 3 , where some are nested in 
others. The goal will be to then apply statistical tests to the models to determine which are 
sufficient or not and which are necessary or not. In the following models, J^LPPiit) below is 
given by expression (jSJ). 

0. Original JLS model M : p\ — 0, 7 = 1 (with k < 1): 

PM (t) = exp(J r LPPL (t)) . (14) 

1. M x : Pl ^0,7= 1: 

PMi(t) = Pi + exp(J r LPPL (t)) . (15) 
Mi includes M as a special case. In other words, M is nested in M\. 

2. M 2 : Pl = 0, 7 e (0,1]: 

/ (F LP PL(t))^ , 7 e(0,i), 

PM 2 {t) = < (16) 
[ exp (^LPPLit)) ,7 = 1. 

Since M 2 includes M as a special case, M is also nested in M 2 . 

3. M 3 : ^^0,7 6(0,1]: 

m J Pi + (FLPPL(t))^ , 7 e (0,1) , 
Pm 3 (*) = S (17) 

^ pi + exp \ J~LPPLif)) , 7 = 1 • 

M3 includes all previous models, Mq, M\ and M 2 as special cases, so that Mq, M\ and 
M 2 are all nested in M3. 

III. CALIBRATION AND RESULTS ON THREE HISTORICAL BUBBLES 
A. Calibration method of the models 



Given an observed asset time series of prices {p bs{t)}, we first transform it into a price 
time series of discounted prices {p(t)} by using expression ([9]). We next determine the three 
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parameters A, B and C in expression (jSj) for each model as a function of the other parameters, 
by solving analytically the system of three linear equations obtained by minimizing the 
square of deviations: 

• ln[p(t)] - T LPPL {t) for M , 

• ln[p(t) -vi] - J^LPPiit) for M u 

• b(t)] 1 - 7 - J^LPpdt) for M 2 and 

• W) ~ PiY' 1 ~ FLPPLit) for M 3 . 

We then determine the other parameters for each model using a Taboo search (to find initial 
parameter estimates) coupled with a Levenberg-Macquardt algorithm. We constrain the 
values of plausible parameters as follows: 

1. the fundamental price p\ should be larger than 0.2p min , where p m i Q := Min[p(t)] over 
the fitting time interval. 

2. The fit parameters t c , m, pi and 7 should not be on the boundary of the search inter- 
vals. They should deviate from these boundaries by at least 1% in relative amplitude. 

3. Among all the fits satisfying the above two conditions, the one with the smallest sum 
of normalized residuals is selected. The cost function we use here is the sum of squares 
of the relative discounted price differences 

«=^, (18) 

where puif) stands for one of the expressions (I14H17I) . 

The critical time t c corresponding to the end of the bubble is searched in [t 2 ; t 2 + 0A(t 2 — 
where the time window of analysis is [ti;^]- The exponent m is constrained in [10 -5 ; 1 — 
10~ 5 ]. The log-angular frequency u> is searched in [0.01; 40]. The phase can take values in 
[0, 2tt — 10~ 5 ]. The fundamental price p\ is in [0.01; 0.99p m i n ] and then restricted by condition 
(i) above. 
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B. Results 

We calibrate models M — M 3 to three well-documented bubbles, which ended in large 
crashes: 

• Hong Kong Hang Seng index (HSI) (*i = Feb. 1, 1995, t 2 = March 13, 1997), 

• S&P 500 index (GSPC) (ti = Sept. 1, 1986, t 2 = Aug. 26, 1987), 

• Shanghai Composite index (SSEC) (t x = Oct. 24, 2008, t 2 = July 10, 2009. 

1. Presentation and discussion 

The results are shown in Figs. [T]- E] and the corresponding parameters are given in Tables [I] 
- IIHI Visually, all models seem to perform similarly, with the determined critical times t c 
close to the true time of the crash. We note that the parameters pi and 7 in Mi,M 2 and 
M3 depart significantly from their reference values p± = and 7 = 1 characterizing model 
M . 

Model Mq corresponds to model M with a slightly different cost-function. Instead of 
minimizing the sum of the squares of terms given by ffTHj) . for t going from t\ to t 2 , the 
parameters of Mq are those of model Mo obtained by minimizing the sum of the squares of 
the difference ln[p Mo (t)] — FLPPLif)- Since lny — lnx = (y — x)/x + 0[(y — x)/x} 2 , the two 
methods should give similar results and the results summarized in tables Hl lIIII confirm this 
expectation. 

Results of detailed statistical comparisons between the four models are shown below. 
Tables Hl lIIII suggest that the five models perform almost equivalently in their ability to fit 
the price accelerations and to determine the time t c of the peak of the bubbles. One can 
note a remarkable stability and consistency of the estimators for the two crucial parameters, 
the exponent m and the angular log- frequency u. However, models Mi and M 3 provide an 
interesting estimation of the size of the bubble, which appears stable with respect to these 
two specifications: at the beginning of the calibration interval, for the Hong Kong bubble, 
models Mi and M 3 estimate that the bubble component might have been already accounting 
for 71% to 80% of the observed price. At the end of the bubble, the bubble component is 
between 85% to 90% of the observed price. Similar values are found for the two other case 
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studies. An exception is for the Shanghai Composite index bubble, for which model M3 
suggests that the fundamental price was 92% of the observed price at the beginning of the 
calibrating interval and about half of the observed price at its peak. 

The models provide a method to measure the amplitude of the crash that follows the 
bubble peak. Consider two types of drawdown after the peak: (i) DD[ 2m0 nths] is the two- 
months drop measured from the peak; (ii) DD m3jX is the peak-to-valley drawdown from the 
peak to the minimum of the asset price after the crash. We calculate the magnitude of the 
crash compared to the over-valued prices as follows. The ratio between the crash magnitude 
and over-valued prices is estimated as: 

Rd = ^—^ -j- i e {[2months],max}. (19) 

Pobs(t P ) -Pi]l s=il+ i(l + r/(s))365 

During the crashes, the hazard rate in Eq. [TTj) should be 1. Then comparing the definition of 
RC and Eq. (ITTj) . one can easily find that k = RC for the models whose 7 = 1 (M , Mi, Mq). 
For the other models, k is different from RC. These values are reported in tables HllHIl 



2. Consistency test of the calibrations 
According to the specification of 



11| . we should verify that the calibrations discussed 
above are self-consistent, i.e., the residuals are stationary. This verification step was pro- 
posed by U] as a possible solution to the problems identified by js]] and [l3] resulting from 



the calibration of non- stationary prices. 

In order to check that the normalized residuals are stationary for all the four models, 
we use the Phillips-Perron and the Dickey-Fuller unit root tests. The null hypothesis Hq is 
that the normalized residuals are not stationary, i.e. they have a unit root. In order to have 
reasonable statistics, we consider time windows of fixed length of 175, 250 or 550 trading 
days. We identify these windows in time series much larger than the (£1,^2) intervals used 
to identify the bubbles (given at the top of Sec. IIIIBj) . The interval lengths correspond to 



the different values of £2 — ti for the respective case studies. We choose overlapping intervals 
with the start of neighboring intervals separated by 25 days. There are 303 windows of size 
550 trading days for the HSI from Jan. 1, 1987 to Feb. 25, 2010; 800 windows for the GSPC 
index from Feb. 2, 1954 to Feb. 10, 2010 with size of 250 trading days; 167 windows of SSEC 
from Aug. 3, 1997 to Jan. 22, 2010 of size of 175 trading days. Note that we choose these 
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dates as the window boundaries because: (i) the chosen (ti,t%) intervals identified at the top 
of Sec. IIIIBI should be one of the windows we get here; (ii) up to the data collection date 
(Feb. 26, 2010), we want to get as many windows as we can. Using the statistical confidence 
level of 99%, we determine the fraction of those windows which reject the Phillips-Perron 
and the Dickey- Fuller unit root tests (Hi), i.e., which qualify as stationary. The results are 
presented in table IIVI We conclude that most of the residuals are found stationary, which 
support the validity of our calibration procedure. 

Previous works have identified the domain of parameters of the calibration of the JLS 
model M which is the most relevant (Johansen and Sornette, 2006; Jiang et al., 2010). 
These conditions, referred to as the LPPL (log-period power law) conditions, are 

B>0; 0.1<m<0.9; 6 < u; < 13; - 1 < C < 1 . (20) 

Imposing that the calibrations obey these LPPL conditions ( 1201) . we find in Table IVl that the 
fraction of the above windows analyzed in Table IIVI which fulfill the stationary conditions 
is significantly increased, augmenting our trust of the quality of the calibration and of the 
relevance of this class of models. 

IV. STATISTICAL COMPARISONS OF THE FOUR GENERALIZED JLS MOD- 
ELS 

A. Standard Wilks test of nested hypotheses assuming independent and normally 
distributed residuals 

Let us consider the five pairs of models with nested structure: (M C Mi), (M C M 2 ), 
(Mi C M 3 ), (M 2 C M 3 ), and (M C M 3 ). Let us denote M, as the model with the smaller 
number of parameters and Mh that with the larger number of parameters. For each pair, we 
use Wilks test of nested hypotheses in terms of the log-likelihood ratios to decide between 
the two hypotheses: 

Hq\ Mi is sufficient and Mh is not necessary. 
Hi. Mi is not sufficient and Mh is needed. 
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We first present in this subsection the tests assuming that the residuals of the calibration 
of the models to the asset price time series are normally and independently distributed. In 
the next subsection, we loosen this restriction. 

For each model Mj, i = 0,1, 2, 3, let us denote the normalized residuals defined by 
expression (fTHj) by Ri(t) and assume that they are i.i.d. Gaussian. For sufficiently large 
time windows, and noting N the number of trading days in the fitted time window [ti;^], 
the Wilks log-likelihood ratio reads 

T = 21og^^ = 2iVln^ + Et = 1 5 (t) _ g£ljlg) ; (21) 

where Ri and 07 (respectively Rh and at) are the residuals and their corresponding standard 
deviation for M t (respectively M h ). 

In the large N limit, and under the above conditions of asymptotic independence and 
normality, the T-statistics is distributed with a xt distribution with k degrees of freedom, 
where k is the difference between the number of parameters in Mh and Mi. We have k = 1 
for the pairs (M ,Ml), (M ,M 2 ), (M 1; M 3 ), (M 2 ,M 3 ), and k = 2 for (M ,M 3 ). The p- 
values associated with the T-statistics given by (12"T]) for each of the five pairs are reported 
in Table IVIl The summary of that table is: 

• Hong Kong Hang Seng index (HSI) from Feb. 1, 1995 to March 13, 1997: Model M 
is never rejected and the standard JLS model is sufficient. 

• S&P 500 index (GSPC) from Sept. 1, 1986 to Aug. 26, 1987: Model M is rejected 
with strong statistical confidence in favor of Mi, M 2 and M 3 . However, when com- 
paring Mi and M 2 to M 3 , we find that M 3 is not necessary. Therefore, we conclude 
that the structure of the S&P 500 index bubble requires the introduction of either 
a fundamental price pi or of a nonlinear crash amplitude as a function of mispricing 
(price for M and M 2 ), but that both ingredients together are not necessary. 

• Shanghai Composite index (SSEC) from Oct. 24, 2008 to July 10, 2009: Only M 3 
improves on M at a confidence level of 92.3% that can be considered as acceptable, 
while Mi and M 2 are not significantly better than Mq for standard confidence levels. 
Consistent with M 3 being rather significantly better than M , it is also better than 
Mi and M 2 , which are themselves not significantly improving on M . There seems to 
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exist both a fundamental value component and a nonlinear over-reaction to mispricing 
in the unfolding of this Chinese bubble. 

B. Comparison between models by bootstrapping to account for non-normality 
and dependence between residuals 

Consider a pair of models (M/ C M/J. Let us assume that Mi is the correct generating 
model of the data. The calibration of Mi to the data gives a specific set of parameters as 
well as a specific realization of residuals. We then use this specification of the model M; 
and its residuals to generate 1000 synthetic time series. A given synthetic time series is 
the calibrated Mi time series on which we add residuals obtained by randomly reshuffling 
the previously obtained residuals. Thus, the 1000 synthetic time series differ from each 
other only by the reshuffling of the residuals. We then calibrate the two models M; and 
Mh on each of these 1000 synthetic time series and calculate the difference of the sum of 
the square of residuals of the fits of these two models. We thus have a list of 1000 different 
d n , n = 1, 1000. Comparing with the corresponding difference d^ t (between Mi and Mh) 
gives us a realistic estimation of the p-value for the null hypothesis that Mi is the correct 
generating model of the data. Specifically, the p- value is the fraction among the 1000 d n 's 
that are larger than d^ t . For instance, if all values d n are smaller than d^ t , we obtain p = 0, 
i.e., it is very improbable that the difference in quality of fit between Mi and Mh results 
solely from the structure of the models and of the residues. We can reject the null and 
conclude that M h is a better necessary model. 

The second test we perform starts with the hypothesis that the true generating process is 
Mh- Thus, the 1000 synthetic time series are now generated by using model Mh calibrated 
on the data and its residuals. Then, the p-value for this null is determined as the fraction 
among the 1000 d n 's that are smaller than d^t- 

Table IVDl summarizes the results, which improve on those shown in Table IVJ by relaxing 
the conditions of normality and of independence between the daily residuals of the calibra- 
tion. The bootstraps are performed by reshuffling the residuals of the fit "every day" or in 
blocks of 25 continuous days ( "every 25 days" ), which is in blocks of 25 continuous days. The 
later allows us to keep the dependence structure over 25 days to test its possible impact on 
the p- values. Reshuffling every day destroys any dependence in the residuals, while keeping 
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their one-point (possibly non-Gaussian) statistics. 

For HSI, taking into account the dependence structure of the residuals up to 25 days 
confirm the results already found in Table [VI] that the standard JLS model M is sufficient 
to explain the observed financial bubble. For GSPC, the results also confirm those of the 
Wilks test in Table IVI[ that Mi and M 2 improve significantly on M , while M 3 is not 
necessary. For SSEC, also in agreement with Table IVI| model M3 is found to be the best 
and to be significant at the 95% confidence level. 

Overall, these tests confirm that the generalized JLS models seem to provide useful im- 
provements over the standard JLS model, both in terms of their explanatory power and in 
the extraction of additional information, specifically the fundamental price pi and a possible 
nonlinear dependence of the crash amplitude as a function of mispricing. 

V. CONCLUSION 

In this paper, we generalized the JLS model by inferring the fundamental value and crash 
nonlinearity from bubble calibration. In the generalized model, one can not only predict the 
crash time of a stock, but also estimate the fundamental value of that stock. Besides, the 
crash nonlinearity can also be estimated. 

Three historical bubbles from different markets are tested by the generalized models. All 
the results suggest that the new models perform very well in describing bubbles, predicting 
crash time and estimating fundamental value and the crash nonlinearity. 

The performance of the new models is tested both under the Gaussian and non-Gaussian 
residual assumptions. Under the Gaussian residual assumption, nested hypothesis testing 
with the Wilks statistics is used and the p- values suggest models with more parameters are 
necessary. Under non-Gaussian residual assumption, we use bootstrap method and get the 
type I and II errors of the hypothesis. All those tests confirm that the generalized JLS 
models provide useful improvements over the standard JLS model. 
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7 


-^-C[2months] 




RMS 


M 


27-Jul-1997 


10 


0.19 


6.97 


0.00 








0.46 


0.62 


0.0320 


Mi 


ll-Jul-1997 


26 


0.25 


6.63 


0.78 


0.20 


0.10 




0.52 


0.69 


0.0320 


M 2 


12-Jul-1997 


25 


0.03 


6.64 


0.87 






0.13 


0.46 


0.62 


0.0319 


M 3 


12-Jul-1997 


25 


0.03 


6.65 


4.04 


0.29 


0.15 


0.11 


0.54 


0.73 


0.0319 


K 


09-Jul-1997 


28 


0.39 


6.53 


3.30 








0.41 


0.55 


0.0323 



TABLE I: Results of the calibration of models Mq — M3 for the Hong Kong Hang Seng index (HSI) 
from Feb. 1, 1995 to March 13, 1997. t c is the critical time of a given model corresponding to 
the end of the bubble and the time at which the crash is the most probable. t\ is the beginning 
of the fitting interval. t p is the time when the asset value peaks before the crash. The relative 
amplitude of the crash following the peak of the bubble is given by -RCr 2mont h s ] and i?C max , which 
are calculated using expression (fl9"j) from the following drawdown amplitudes: (i) -D-Dpmonths] is 
the two-months drop measured from the peak; (ii) DD max is the peak-to-valley drawdown from 
the peak to the minimum of the asset price. RMS is the root mean square of the distances between 
historical prices and the model values, i.e., the square root of the sum of the squares of terms given 
by (|18p . for t going from t± to t 2 , where t 2 is the last date of the time window used for the analyses. 
The model denoted Mq corresponds to model Mq with a different calibration method, as explained 
in the text. 



GSPC 


tc 




m 


(jj 





Pf(tl) 

p(tl) 


Pf(tp) 
p(t P ) 


7 


-^C[2months] 


-^Cmax 


RMS 


M 


13-Sep-1987 


19 


0.70 


6.62 


0.00 








0.34 


0.35 


0.0196 


Mi 


03-Sep-1987 


9 


0.68 


6.10 


0.00 


0.18 


0.14 




0.40 


0.40 


0.0190 


M 2 


05-Sep-1987 


11 


0.63 


6.09 


0.00 






0.72 


0.34 


0.35 


0.0191 


M 3 


03-Sep-1987 


9 


0.64 


6.10 


0.00 


0.18 


0.14 


0.64 


0.40 


0.40 


0.0190 


K 


26-Aug-1987 


1 


0.68 


5.59 


0.14 








0.32 


0.33 


0.0187 



TABLE II: Same as Table U for the S&P 500 index (GSPC) from Sept. 1, 1986 to Aug. 26, 1987. 
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SSEC 




I | 


m 




<f> 


Pf(ti) 
p(ti) 


p/(*p) 
p(tp) 


7 


-RCpmonths] 




RMS 


M 


29-Jul-2009 


2 


0.63 


16.60 


0.00 








0.23 


0.23 


0.0258 


Mi 


24-Jul-2009 


3 


0.77 


15.86 


1.94 


0.36 


0.19 




0.29 


0.29 


0.0256 


M 2 


21-Jul-2009 


6 


0.69 


15.52 


6.28 






0.99 


0.23 


0.23 


0.0257 


M 3 


24-Jul-2009 


3 


0.65 


15.96 


2.49 


0.92 


0.49 


0.20 


0.45 


0.45 


0.0254 


Mo 


24-Jul-2009 


3 


0.68 


15.86 


5.12 








0.23 


0.23 


0.0256 



TABLE III: Same as Table[Qfor the Shanghai Composite index (SSEC) from Oct. 24, 2008 to July 
10, 2009. 



Percentage of stationary 


M 


Mi 


M 2 


M 3 


303 HSI windows from Jan. 1, 1987 to Feb. 25, 2010, length 550. 


Phillips-Perron 


96.7% 


98.0% 


96.7% 


97.7% 


Dickey-Fuller 


96.7% 


98.0% 


96.7% 


97.7% 


800 GSPC windows from Feb. 2, 1954 to Feb. 10, 2010, length 250. 


Phillips-Perron 


90.6% 


91.0% 


91.8% 


91.8% 


Dickey-Fuller 


90.6% 


91.0% 


91.8% 


91.8% 


167 SSEC windows from Aug. 3, 1997 to Jan. 22, 2010, length 175. 


Phillips-Perron 


96.4% 


97.0% 


96.4% 


97.0% 


Dickey-Fuller 


96.4% 


97.0% 


96.4% 


97.0% 



TABLE IV: Percentage of stationary residuals for the Phillips-Perron and Dickey-Fuller tests. 
Significance level: 99%. 
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Percentage of stationary under LPPL constrains 


M 


M 1 


M 2 


M 3 


303 HSI windows from Jan. 1, 1987 to Feb. 25, 2010, length 550. 


Plppl 


0.99% 


0.99% 


2.64% 


1.98% 


Phillips-Perron 


100% 


100% 


100% 


100% 


Dickey- Fuller 


100% 


100% 


100% 


100% 


800 GSPC windows from Feb. 2, 1954 to Feb. 10, 2010, length 250. 


Plppl 


4.50% 


6.00% 


4.50% 


5.87% 


Phillips-Perron 


95.7% 


100% 


97.9% 


100% 


Dickey- Fuller 


95.7% 


100% 


97.9% 


100% 


167 SSEC windows from Aug. 3, 1997 to Jan. 22, 2010, length 175. 


Plppl 


4.19% 


4.79% 


8.38% 


9.58% 


Phillips-Perron 


93.8% 


92.9% 


100% 


100% 


Dickey- Fuller 


93.8% 


92.9% 


100% 


100% 



TABLE V: Percentage of stationary residuals, as qualified by the Phillips-Perron and Dickey- 
Fuller tests, which obey the LPPL conditions (|20p . The variable Plppl gives the fraction of fits 
that satisfy the conditions (|20p . independently of whether their residuals are stationary or not. 
Significance level: 99%. 





(M , M x ) 


(M ,M 2 ) 


(M 1) M 3 ) 


(M 2 ,M 3 ) 


(M ,M 3 ) 


HSI 


0.4710 


0.2210 


0.3221 


0.9626 


0.4723 


GSPC 


0.0003 


0.0006 


0.7930 


0.2150 


0.0012 


SSEC 


0.1405 


0.2494 


0.0863 


0.0516 


0.0775 



TABLE VI: p- value of the null hypothesis Hq for pairs of models (M;,M/j) that Mi is sufficient 
and Mh is not necessary, using Wilks log- likelihood ratio statistics. Low p- value indicates the 
improvement of Mh compared to M/ is significant and Hq is rejected. 
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(M , Mi) 


(M ,M 2 ) 


(M ,M 3 ) 


(M 1 ,M 3 ) 


(M 2 ,M 3 ) 


HSI shuffle every day 


Mi true 











0.05 


0.75 


M h true 











0.10 


0.60 


HSI shuffle every 25 days 


Mi true 


0.46 


0.20 


0.42 


0.26 


0.76 


M h true 


0.42 


0.12 


0.38 


0.18 


0.70 


GSPC shuffle every day 


Mi true 











0.35 


0.45 


M h true 


0.05 








0.45 


0.40 


GSPC shuffle every 25 days 


Mi true 


0.05 





0.05 


0.40 


0.50 


M h true 











0.50 


0.45 


SSEC shuffle every day 


Mi true 











0.05 


0.35 


M h true 





0.05 





0.05 


0.50 


SSEC shuffle every 25 days 


Mi true 


0.14 


0.08 


0.04 


0.04 


0.38 


M ft true 


0.12 


0.06 


0.06 


0.08 


0.40 



TABLE VII: p- values calculated by bootstraping (see text for explanation). Low p- value indicates 
the improvement of Mh compared to Mi is significant. 
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HSI fit results. Fitting interval from 01-Feb-1995 to 13-Mar-1997 
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« History index price 
-M Q , p 1 =0,y=1 

M r P^O, y = 1 
M 2 , p 1= 0, ye (0,1] 
"M 3 , p^O, ye (0,1] 
— M ' fit with log-price 
--fundamental price of M 
— fundamental price of NL 



16-Jan- 




1995 



07-Aug-1995 



27-Feb-1996 



18-Sep-1996 



10-Apr-1997 



31 -Oct- 1997 



FIG. 1: Calibration of the different models to the Hong Kong Hang Seng Index. The fit interval is 
shown with vertical black dashed lines. The fitted critical time t c when the crash is most probable 
according the modified JLS models are marked by vertical dashed lines with the same color as the 
corresponding fits with each model. The historical close prices are shown as blue empty circles. 
The fundamental price for M\ and M3 are also shown as the almost horizontal dashed lines (beware 
of the break in the vertical scales for low values). 
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GSPC fit results. Fitting interval from 01-Sep-1986 to 26-Aug-1987 
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• History index price 

— M Q , P 1 =0,y=1 

M r p^O, y= 1 
M 2 , p 1= 0, ye (0,1] 
"--M 3 , p^O, ye (0,1] 

— M Q ', fit with log-price 

— - -fundamental price of M 
—-fundamental price of 



21 -Aug- 1986 



16-Nov-1986 



11-Feb-1987 



09-May-1987 



04-Aug-1987 



30-Oct-1987 



FIG. 2: Same as figure Q] for the S & P 500 Index. 
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SSEC fit results. Fitting interval from 24-Oct-2008 to 10-Jul-2009 

• History index price 
— M Q , p 1= 0,y=1 

M r p^0, y= 1 




0, ye (0,1] 



---M 3 , p^0, ye (0,1] 

- M Q ', fit with log-price 

- - -fundamental price of M 1 

- - fundamental price of M 



15-Dec-2008 15-Feb-2009 18-Apr-2009 



19-Jun-2009 



20-Aug-2009 



FIG. 3: Same as figure [T] for the Shanghai Composite Index. 
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